Distributed Reorder Buffer Schemes for Low Power
نویسندگان
چکیده
We consider several approaches for reducing the complexity and power dissipation in processors that use separate register file to maintain the commited register values. The first approach relies on a distributed implementation of the Reorder Buffer (ROB) that spreads the centralized ROB structure across the function units (FUs), with each distributed component sized to match the FU workload and with one write port and two read ports on each component. The second approach combines the use of previously proposed retention latches and a distributed ROB implementation that uses minimally ported distributed components. The second approach avoids any read and write port conflicts on the distributed ROB components (with the exception of possible port conflicts in the course of commitment) and does not incur the associated performance degradation. Our designs are evaluated using the simulation of the SPEC 2000 benchmarks and SPICE simulations of the actual ROB layouts in 0.18 micron process.
منابع مشابه
An Energy Efficient Instruction Window for Scalable Processor Architecture
Modern microprocessors achieve high application performance at the acceptable level of power dissipation. In terms of power to performance trade-off, the instruction window is particularly important. This is because enlarging the window size achieves high performance but naive scaling of the conventional instruction window can severely increase the complexity and power consumption. In this pape...
متن کاملA Reorder Buffer Design for High Performance Processors
Modern reorder buffers (ROBs) were conceived to improve processor performance by allowing instruction execution out of the original program order and run ahead of sequential instruction code exploiting existing instruction level parallelism (ILP). The ROB is a functional structure of a processor execution engine that supports speculative execution, physical register recycling, and precise excep...
متن کاملEnergy – Efficient Design of the Reorder Buffer 1
Some of today’s superscalar processors, such as the Intel Pentium III, implement physical registers using the Reorder Buffer (ROB) slots. As much as 27% of the total CPU power is expended within the ROB in such designs, making the ROB a dominant source of power dissipation within the processor. This paper proposes three relatively independent techniques for the ROB power reduction with no or mi...
متن کاملThe Impact of Fetch Rate and Reorder Buffer Size on Speculative Pre-Execution
Pre-execution systems reduce the impact of cache misses and branch mispredictions by forking a slice, a code fragment derived from the program, in advance of frequently mispredicted branches and frequently missing loads in order to either resolve the branch or prefetch the load. Because unnecessary instructions are omitted the slice reaches the branch or load before the main thread does, for lo...
متن کاملAn Adaptive Technique for Reducing Leakage and Dynamic Power in Register Files and Reorder Buffers
Contemporary superscalar processors, designed with a one-size-fitsall philosophy, grossly overcommit significant portions of datapath resources that remain unnecessarily activated in the course of program execution. We present a simple scheme for selectively activating regions within the register file and the reorder buffer for reducing leakage as well as dynamic power dissipation. Our techniqu...
متن کامل